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1. From any browser, access the phosphositeplus website (www.phosphosite.org) 
a. Navigate to Downloads > Datasets from PSP 
b. Accept the terms and conditions 
c. Download the Methylation_site_dataset.gz file 
d. Open file using Microsoft Excel 


2. From the main file, duplicate the sheet (rename "Lysine Methylation") 
a. Sort MOD_RSD column alphabetically, and keep only rows with K methylation 


3. For KxP methylation sites: 
a. Create a new duplicate sheet (rename "KxP methylation") 


b. Use the SITE_+/-_7AA column for the searches below (in our spreadsheet, it was column J) 
c. Create a column "K?P" --> use function =SEARCH("K?P",J2) and then drag down to create the column for all rows 
i. In our spreadsheet, it was column K 
ii. This will provide all AA sequences that have a KxP motif, with the position of the K indicated as a number 
d. Sort column "K?P" in ascending order, and keep only rows with values present 
e. Create another column "K?P"--> use function =MID(J2,SEARCH("K?P",J2,3) and drag down to create a column for all rows 
i. Inour spreadsheet, it was column L 
ii. This will provide all the KxP motifs from the SITE_+/-_7AA column 
f. Create another column "k?P" --> use function =FIND("k", L2) and drag down to create a column for all rows 
i. Inour spreadsheet, it was column M 
ii. This will provide a value of 1 if the lysine from the KxP is UNcapitalized, i.e., itis the site that is being modified 
ii. If the motif from column L has a methylated lysine at another spot, it will return 2 or 3 
iv. If the motif from column L had a CAPITALIZED K, then this column will return #VALUE 
g. Create a new duplicate sheet (rename "KxP no repeats") 
i. Delete all rows in which the "k?P" column returned #VALUE 
ii. Sort sheet by GENE column (in our spreadsheet, it was column A) 
iii. Use Data --> remove duplicate function and select column A 


4. For KxxGG methylation sites 
a. Create a new duplicate spreadsheet from the "Lysine Methylation" sheet (rename "KxxG methylation") 
b. Repeat above steps using "K??G" instead of "K?P", and "k??G" instead of "k?P" 
c. Retain the rows in which the "k??G" =FIND("k",L2) function returned the value of 1 
d. Make the sheet "KxxG methylation no repeats" 


5. Make a new sheet (rename "KxP_KxxG") 
a. Copy all rows from the "KxP no repeats" and "KxxG no repeats" 
b. Create a new duplicate sheet (rename "KxP_KxxG no repeats") 
i. Sort by GENE (column A) 
ii. Use Data --> remove duplicate function and select column A 


Spreadsheet used in our analysis available upon request once the protocol has been utilized (please contact cheryl.walker@bcm.edu) 
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